-
Notifications
You must be signed in to change notification settings - Fork 536
fix: support system columns in dataset.take* operations #5722
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fix: support system columns in dataset.take* operations #5722
Conversation
Signed-off-by: Daniel Rammer <hamersaw@protonmail.com>
Signed-off-by: Daniel Rammer <hamersaw@protonmail.com>
|
ACTION NEEDED The PR title and description are used as the merge commit message. Please update your PR title and description to match the specification. For details on the error please inspect the "PR Title Check" action. |
Signed-off-by: Daniel Rammer <hamersaw@protonmail.com>
Codecov Report❌ Patch coverage is 📢 Thoughts on this report? Let us know! |
Xuanwo
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you @hamersaw for working on this! Only have a question.
Signed-off-by: Daniel Rammer <hamersaw@protonmail.com>
Signed-off-by: Daniel Rammer <hamersaw@protonmail.com>
Signed-off-by: Daniel Rammer <hamersaw@protonmail.com>
Signed-off-by: Daniel Rammer <hamersaw@protonmail.com>
Signed-off-by: Daniel Rammer <hamersaw@protonmail.com>
Signed-off-by: Daniel Rammer <hamersaw@protonmail.com>
Signed-off-by: Daniel Rammer <hamersaw@protonmail.com>
|
looks like there are quite a few CI failures, could you fix those? |
westonpace
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This looks good but can you add a few tests? Preferably python tests.
Signed-off-by: Daniel Rammer <hamersaw@protonmail.com>
Signed-off-by: Daniel Rammer <hamersaw@protonmail.com>
Signed-off-by: Daniel Rammer <hamersaw@protonmail.com>
westonpace
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Awesome. Thanks for doing this. I'll merge when green.
|
I've added #5823 which describes some of the stuff we talked about externally. |
|
Looks like some legitimate test failures in the new python tests |
Signed-off-by: Daniel Rammer <hamersaw@protonmail.com>
Signed-off-by: Daniel Rammer <hamersaw@protonmail.com>
jackye1995
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good to me!
Previously, "take*" operations did not support
_rowid,_rowoffset,_row_created_at_version, and_row_last_updated_at_version. In this PR we add support for all of these columns.We preserve these system columns through the initial schema projection so that they can be used to populate the correct flags when building the
ProjectionPlanandPhysicalProjectionstructs._rowid/_rowaddr: persisting these through toProjectionPlanfields was enough to make them work_rowoffset: required additionally (1) strippingROW_OFFSETfield fromProjectionPlanrequested_output_exprand (2) manually injecting column usingAddRowOffsetExec(after exposing some methods publicly)_row_created_at_version/_row_last_updated_at_version: required piping through flags toFragmentreaders.Closes #5615.